Incorporating Variation of Model-specific Score Distribution in Speaker Verification Systems
نویسندگان
چکیده
It has been shown that the authentication performance of a biometric system is dependent on the models/templates specific to a user. As a result, some users may be more easily recognized or impersonated than others. The various categories of users have been characterized by Doddington et al.(1988). We refer to this unbalanced performance across users as the Doddington’s zoo effect. In the context of fusion, we argue that this effect is system-dependent, i.e., a user model that is easily impersonated (a lamb) in one system may be easily recognized in another system (a sheep). While in principle, fusion system could be trained to cope with the changing animal behavior of users from system to system, the lack of training data makes it impossible. We believe that one major cause of the Doddington’s zoo effect is the variation of class conditional scores from one speaker model to another. We propose a two-level fusion framework that effectively realizes a fusion classifier adapted to each user. Firstly, one applies a client-specific (or model-specific) score normalization procedure to each of the system outputs to be combined. Then, one feeds the resulting normalized outputs to a fusion classifier (common to all users) as input to obtain a final combined score. Two existing model-specific score normalization procedures are considered in this framework, i.e. Fand Z-norms. In addition to them, a novel score normalization method called model-specific log-likelihood ratio (MS-LLR) is also proposed. While Z-norm is impostor-centric, i.e., it makes use of only the impostor score statistics, F-norm and the proposed MS-LLR are client-impostor centric, i.e., they consider both the client and impostor score statistics simultaneously. Our findings based on the XM2VTS and the NIST2005 databases show that when client-impostor centric normalization procedures are used to implement the proposed two-level fusion framework, the resulting fusion Norman Poh and Josef Kittler are with CVSSP, University of Surrey, Guildford, GU2 7XH, Surrey, UK. E-mails: [email protected], [email protected]
منابع مشابه
Using Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملStudy on the effects of intrinsic variation using i-vectors in text-independent speaker verification
Speaker verification performance is adversely affected by mismatches between training and testing data in intrinsic variations. This paper explores how recent technologies focused on modeling the total variability behave in addressing the effects of intrinsic variation in speaker verification. The effects of intrinsic variation are investigated from six aspects including speaking style, speakin...
متن کاملA Review of Various Score Normalization Techniques for Speaker Identification System
This paper presents an overview of a state-of-the-art text-independent speaker verification system using score normalization. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization used in speaker verification, namely, cepstral analysis, is detailed. Normalization of scores is then explai...
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملAn implementation and evaluation of an on-line speaker verification system for field trials
This paper presents a HMM-based speaker verification system which was implemented for a field trial. One of the challenges for moving HMM from speech recognition to speaker verification is to understand the HMM score variation and to define a proper measurement which is comparable across speech samples. In this paper we define two basic verification measurements, a qualifier-based measurement a...
متن کامل